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We present a method for measuring the cosmic matter budget without assumptions about spec- 
ulative Early Universe physics, and for measuring the primordial power spectrum Pt{k) non- 
parametrically, either by combining CMB and LSS information or by using CMB polarization. Our 
method complements currently fashionable "black box" cosmological parameter analysis, constrain- 
ing cosmological models in a more physically intuitive fashion by mapping measurements of CMB, 
weak lensing and cluster abundance into fc-space, where they can be directly compared with each 
other and with galaxy and Lya forest clustering. Including the new CBI results, we find that CMB 
measurements of P{k) overlap with those from 2dF galaxy clustering by over an order of magnitude 
in scale, and even overlap with weak lensing measurements. We describe how our approach can 
be used to raise the ambition level beyond cosmological parameter fitting as data improves, testing 
rather than assuming the underlying physics. 



I. INTRODUCTION 

What next? An avalanche of measurements have 
now lent support to a cosmological "concordance model" 
whose free parameters have been approximately mea- 
sured, tentatively answering many of the key questions 
posed in past papers. Yet the data avalanche is showing 
no sign of abating, with spectacular new measurements 
of the cosmic microwave background (CMB), galaxy clus- 
tering, Lyman a forest (LyaF) clustering and weak lens- 
ing expected in coming years. It is evident that many 
scientists, despite putting on a brave face, wonder why 
they should care about all this new data if they already 
know the basic answer. The awesome statistical power of 
this new data can be used in two ways: 

1. To measure the cosmological parameters of the con- 
cordance model (or a replacement model if it fails) 
to additional decimal places 

2. To test rather than assume the underlying physics 

This paper is focused on the second approach, which has 
received less attention than the first in recent years. As 
we all know, cosmology is littered with "precision" mea- 
surements that came and went. David Schramm used to 
hail Bishop Ussher's calculation that the Universe was 
created 4003 b.c.e. as a fine example — small statistical 
errors but potentially large systematic errors. A strik- 
ing conclusion from comparing recent parameter estima- 
tion papers (say [1-4] by the authors for methodologically 
uniform sample) is that the quoted error bars have not 
really become smaller, merely more believable. For in- 
stance, a confidence interval for the dark energy density 
that would be quoted three years ago by assuming that 
four disparate data sets were all correct [1] can now be 
derived from CMB + LSS power spectra alone [4-7] and 
independently from CMB + SN la as a cross-check. 
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FIG. 1. Measurements of the linear matter power spectrum 
P{k) computed as described in the text, using the concordance 
model of [5] (solid curve) to compute window functions. The loca- 
tions of the CMB points depend on the matter budget and scales 
with the reionization optical depth as e^^ for k > 0.002. Correcting 
for bias shifts the 2dF galaxy points [8] vertically [b = 1.3 assumed 
here) and should perhaps blue-tilt them slightly. The cluster point 
scales vertically as (Hm/O.S)"'^ '^ , and its error bars reflects the 
spread in the literature. The lensing points are based on [9]. The 
LyoF points are from a reanalysis [10] of [11] and have an overall 
calibration uncertainty around 17%. 

This paper aims to extend this trend, showing how 
measurements can be combined to raise the ambition 
level beyond simple parameter fitting, testing rather than 
assuming the underlying physics. Many of the dozen or 
so currently fashionable cosmological parameters merely 
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parametrize two cosmological functions [12,13]: the cos- 
mic expansion history a{t) and the cosmic clustering his- 
tory P{k,z), the observables corresponding to 0th and 
1st order cosmic perturbation theory, respectively. This 
means that non-parametric measurements of these cos- 
mological functions allows testing whether the assump- 
tions associated with the cosmological parameters are in 
fact valid. Moreover, if there are discrepancies, compar- 
ing measurements of these functions from different data 
sets reveals whether the blame lies with theory, data or 
both. 

We will limit our treatment to the 1st order func- 
tion, P{k,z), since the 0th order function has been ex- 
tensively discussed previously [12-14]. One of the key 
ideas of this paper is summarized in Figure 1, showing 
how CMB, LSS, clusters, weak lensing and LyaF all con- 
strain P{k, z) at z = 0. The first plots that we are aware 
of showing CMB in k-space go back a decade [15], when 
CMB merely probed scales much larger than accessible 
to large-scale structure measurements. Since then, CMB 
has gradually pushed to smaller scales with improved an- 
gular resolution while LSS has pushed to larger scales 
with deeper galaxy surveys. What is particularly excit- 
ing now, and makes this paper timely, is that the two 
have met and overlapped, especially with the CBI ex- 
periment [16] and the 2dF [17] and SDSS [18] redshift 
surveys. Figure 1 shows that CMB now overlaps also 
with the scales probed by cluster abundance and even, 
partly, with weak lensing. 

P{k,z) can be factored as the product of a primor- 
dial power spectrum P,:{k) and a transfer function, cor- 
responding to the physics of the Early Universe and the 
Late Universe, respectively*. The two involve completely 
separate physical processes and assumptions that need to 
be tested, and the purpose of our method is to measure 
these two factors separately using observational data. 
Given a handful of cosmological parameters specifying 
the cosmic matter budget and the reionization epoch, 
the transfer function can be computed from first prin- 
ciples using well-tested physics (linearized gravity and 
plasma physics at temperatures similar to those at the so- 
lar surface) . The primordial power spectrum is on shakier 
ground, generally believed to have been created in the 
Early Universe at an energy scale never observed and in- 
volving speculative new physical entities. Most work has 
parametrized this function as a power law P^ (fc) oc fc" 
or a logarithmic parabola P^,{k) cx ^"+"1"^^ inspired by 
the slow-roll approximation in inflationary models [19], 
usually with a = 0. More general parametrizations have 
included broken power laws [20-23] a picccwisc constant 
function [24] and other forms [25,26]. It has also been 
shown [24] that the MAP CMB data [27] in combina- 



*Wo will assume that the primordial fluctuations are adia- 
batic, discussing the most general case in Section IV. 



tion with SDSS power spectrum measurements should 
be able to constrain the shape of P„{k) in considerable 
detail. The key challenge is breaking the degeneracy 
between the two factors, P«(A:) and the transfer func- 
tion. Although a future brute-force likelihood analysis 
parametrizing P, (fc) with, say, 20 parameters would be 
interesting and perfectly valid, it would obscure the sim- 
plicity of the underlying physics. Such a "black box" 
approach would entail computing many different curves 
for each point in parameter space (such as Ce for CMB, 
P{k) for galaxies, the aperture mass function Mapp(^) 
for lensing and the cluster mass function), and mapping 
out the 20-dimensional likelihood function numerically 
by marginalizing over other cosmological parameters like 
those of the matter budget. This would be overkill, since 
(modulo nonlinearity complications treated below) all 
measurements shown in Figure 1 can be recast directly 
as weighted averages of P^ (k) . 

The rest of this paper is organized as follows. In Sec- 
tion II, we describe the construction of Figure 1, ex- 
plaining how CMB, weak lensing and cluster abundance 
measurements can be mapped into (linear) fc-space. In 
Section III, we turn to the degeneracy between P*(fc) 
and cosmological parameters such as the various matter 
densities, and present our method for breaking it. Wo 
show how this allows measuring the cosmic matter bud- 
get without assuming anything about P«(A;) and obtain- 
ing a non-parametric measurement of P*(A;). 

II. MEASURING P{k) WHEN THE TRANSFER 
FUNCTIONS ARE KNOWN 

In this section, we discuss how measurements of CMB, 
weak lensing, cluster abundance, LyaF and galaxy clus- 
tering probe P{k) and P*(fc) when the relevant transfer 
functions are known. We will see that in all five cases, 
each measured data point di (a CMB band power ST^, a 
lensing aperture mass variance (Mapp(^)^), etc.) can be 
written as an integral 

/oo 
Vi{k)d\nk (1) 
-oo 

over (linear) wavenumber k for some non-negative inte- 
grand Vi{k). Renormalizing Pj(fc) to be a probability 
distribution, our convention (★) in Figure 1 is the follow- 
ing: 

(★) Plot the data point at the k-value corresponding to 
the median of this distribution with a horizontal bar 
ranging from the 20th to the 80th percentile. 

These percentiles correspond to the full-width-half- 
maximum for the special case of a Gaussian distribution. 
In other words, the horizontal bars indicate the range 
of scales k contributing to the data point. All trans- 
fer functions in this section are computed assuming the 
flat ACDM concordance model of [5] — we return the 
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more general case in the next section. This is a flat, 
scalar, scale-invariant model with cold dark matter den- 
sity h'^^m — 0.12, baryon density h'^ilb = 0.021 and cos- 
mological constant flA = 0.71. This corresponds to a 
Hubble parameter h = 0.70. We chose a reionization op- 
tical depth T = 0.05, which corresponds to reionization 
at redshift z = 8 with these parameters. We normalize 
the model to have as = 0.815, which provides a good fit 
to the CMB data. 



A. CMB data 

Figure 2 shows all 119 CMB detections currently avail- 
able, extending the compilation in [4] by adding the new 
measurements from the Very Small Array (VSA) [28] and 
the Cosmic Background Imager (CBI) mosaic [16]. Re- 
cent data reviews include [29,30]. 
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FIG. 2. CMB data used in our analysis. Error bars in the plot 
do not include calibration or beam errors which allow substantial 
vertical shifting and tilting for some experiments (these effects were 
included in our analysis). 
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FIG. 3. Combination of data from Figure 2. These error bars 
include the effects of beam and calibration uncertainties, which 
cause long-range correlations of order 5%-10% over the peaks. 
In addition, points tend to be anti-correlated with their nearest 
neighbors, typically at the level of a few percent. The horizontal 
bars give the characteristic widths of the window functions (see 
text). The curves show the fiat ACDM concordance models from 
[4] (green/light gray) and [5] (blue/dark gray). 

We combine these measurements into a single set of 25 
band powers shown in Figure 3 and Table 1 using the 
method of [4] , including calibration and beam uncertain- 
ties, which effectively calibrates the experiments against 
each other. The coefhcients yi in equations (A2), (A4) 
and (A5) of [4] should strictly speaking be the window- 
convolved true power spectrum, so rather than approx- 
imating them by the observed data points as in [4], we 
approximate them by the the smooth fit to the data given 
by the green/light gray curve in Figure 3 convolved with 
the experimental window functions. We have excluded 
the PythV data since it disagrees with numerous other 
experiments on large angular scales and the non-release 
of the underlying map precludes clarifying this situa- 
tion. Since our compressed band powers de are simply 
linear combinations of the original measurements, they 
can be analyzed ignoring the details of how they were 
constructed, being completely characterized by a window 
matrix W: 



(2) 



where Ce = ST^ = £{i + l)Ci/2Tr and d is the an- 
gular power spectrum. This matrix is available at 
www .hep.upenn.edu/ '^max / cmb / experiments .html to- 
gether with the 25 band powers di and their 25 x 25 
covariancc matrix. Following the convention used in Fig- 
ure 1, the data values and effective ^-ranges in Figure 3 
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and Table 1 correspond to the median, 20th and 80th 
percentile of the window functions W. (We use abso- 
lute values of the window function to be pedantic, since 
some windows go slightly negative in places as a result of 
the inversion, although this makes a negligible difference 
for the plot.) Comparing Table 1 with the older results 
from [4], we find that the degree-scale normalization is 
marginally higher. In bin 8, for instance, corresponding 
to the 1st peak, the normalization has risen by 3% due 
to the inclusion of the VSA and CBI results and a fur- 
ther 6% due to the above-mentioned improved modeling 
of calibration and beam errors. With the old modeling, 
a measurement scattering low by chance would be as- 
signed a smaller error and therefore get more statistical 
weight, pulling the overall calibration down somewhat. 
A detailed discussion of calibration issues can be found 
in [31]. 



Table 1 - Band powers combining the information from CMB data 
from Figure 2. The 1st column gives the £-bins used when com- 
bining the data, and can be ignored when interpreting the results. 
The 2nd column gives the medians and characteristic widths of the 
window functions as detailed in the text. The error bars in the 
3rd column include the effects of calibration and beam uncertainty. 
The full 25 X 25 correlation matrix and 25 X 2000 window matrix are 
available at www. hep. upenn.edu/ ^ max / cmb / experiments .html . 
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The angular power spectrum of the CMB is determined 
by the primordial power spectrum P*{k) through a linear 
relation 



/oo 
We{k)P4k)d\nk, (3) 
-C30 

where the transfer functions We{k) depend on the cosmic 
matter budget and the reionization optical depth. Since 

P{k)^T{kfP,{k) (4) 

where T{k) is the matter transfer function, equation (3) 
can be reexpressed directly in terms of the current power 
spectrum: 

Ct^ p^^^Pik)d\nk. (5) 
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FIG. 4. The curves kW£{k) whose integral give Cf for a 
scale-invariant spectrum, all rescaled to have unit area. From left 
to right, the curves are for multipoles £ = 2, 4, 8, 16, 32, 64, 128, 
256, 512, 1024. 

Equation (3) implies that for the CMB with data 
points de = Ce, the integrand of equation (1) is simply 
■Pf(fc) = Wi{k) P^, (k) . We compute the integral kernels 
Wi{k) with CMBfast [32]. Figure 4 shows Viik) for a 
sample of ^-values, normalized to integrate to unity, for 
a scale-invariant primordial power spectrum P*(A;) oc k. 
In other words, the figure simply shows kWi{k) rescaled, 
the integral of which gives Ci. For each such curve, we 
compute the 20th, 50th and 80th percentile as per the 
above-mentioned convention (*) and plot the results in 
Figure 5 as an indication of what fc-range is probed by 
each multipole £. The situation is completely analogous 
for the polarization case. The relations between i and k 
are seen to be roughly linear as expected, and to tighten 
with increasing £. 

The slight wiggles roughly line up with the derivatives 
of the three CMB power spectra. This is because when 
the power spectrum is steeply rising, the contribution 
will be larger from the peak on the right than from the 
trough on the left, pushing the median up towards higher 
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k, and vice versa. These wiggles are seen to be more pro- 
nounced for E-polarization than for the unpolarized case. 
This is because the wiggles are sharper and have greater 
relative amplitude for the polarized case, increasing the 
magnitude of the derivative. The T-spectrum has milder 
wiggles since the peculiar velocity contribution fills in 
the troughs between the peaks from the dominant den- 
sity/gravitational contribution — the E-power spectrum 
has only a velocity contribution and thus drops near zero 
between peaks, staying positive only because of geometric 
projection effects in the mapping from /c-space to ^-space. 



where 

W,{k) = Y,^^eWe{k). (7) 

In other words, each of our 25 binned CMB measure- 
ments probes a known linear combination of the primor- 
dial power spectrum. A sample of these new window 
functions are plotted in Figure 6, and are again seen to be 
quite narrow for large £. Indeed, although the £-smearing 
makes these windows slightly broader than those in Fig- 
ure 4, it is also seen to make them more well-behaved, 
eliminating the high-frequency oscillations at large i. 
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FIG. 5. The correspondence between f-space and fc-space for 
CMB. For each the shaded bands indicates the fc-range from 
the 20th to 80th percentile of the distribution kW({k) (Figure 4), 
and the black curve shows the median. From top to bottom, the 
three bands are for the E-polarization, cross-polarization (X) and 
unpolarized (T) cases, respectively. To avoid clutter, the E and X 
bands have been multiplied by 10 and 100, respectively. 

Because of incomplete sky coverage, real-world CMB 
measurement can never measure individual multipoles i, 
merely weighted averages of many. Substituting equa- 
tion (3) into equation (2) gives 

/oo 
W,{k)P,{k)d\nk, (6) 
-oo 
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FIG. 6. Sample curves kWi{k) whose integral give our binned 
CMB band powers from Figure 3 for a scale-invariant spectrum, all 
rescaled to have unit area. From left to right, the curves correspond 
to band powers i = 1, 2, 3, 6, 9, 14, 19, 24. 

We are now ready to map our CMB measurements 
from Figure 3 into fc-space. We need a prescription 
for where to position the points both horizontally and 
vertically. Horizontally, we simply follow the above- 
mentioned convention (★) and plot it at the median of 
the distribution Vi{k) — kWi{k) from Figure 6, with hor- 
izontal bars extending from the 20th to the 80 percentile. 
Vertically, we plot it at the value P*j defined by 

^ jr^kW-{k)d\nk- 

Taking the expectation value of this and using equa- 
tion (6) tells us that we can interpret P,^ as measuring 
simply a weighted average of k~^P^{k) (which we ex- 
pect to be a nearly constant function), with the window 
function kWi{k) giving the weights. The resulting 25 
measurements of P*{k) are shown in Figure 7, and Fig- 
ure 8 shows a simulation for measurements by the MAP 
satellite [27]. To plot these points as measurements of 
P(fc), we proceed analogously. We use exactly the same 
convention (★) for the horizontal placement of the points. 



and given equation (4) plot them at a vertical position 
given by 



P = T{k,sfP* 



(9) 



where k^s is the horizontal location of the point (the me- 
dian of the window function). This allows us to interpret 
Pi/Pfid as measuring simply a weighted average of the 
relative power P(fc)/Pfid(fc), where Pfid(^) = kT{k)'^ is 
our fiducial power spectrum. This procedure produces 
the CMB points plotted in Figure 1 and Figure 9. 
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FIG. 7. Measurements of the primordial power spectrum P*{k) 
computed as described in the text, divided by our fiducial primor- 
dial power spectrum Ak with A ^ (43.2Mpc//i)*. This normaliza- 
tion corresponds to an rms fluctuation in the gravitational potential 
■4> given by [fc3p^(fe)/27r2]i/2 = 2.9 X 10-'\ The data are the same 
as in Figure 1, divided by AkT{k)'^. 
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FIG. 8. Same as Figure 7, but for simulated future data. CMB 
and galaxy points assume the advertized specifications for the com- 
plete MAP and SDSS datasets, respectively. The CMB simulation 
assumes the MID foreground model of [33] . Still better galaxy mea- 
surements are likely to result adding the SDSS luminous red galaxy 
survey [34] and photometric redshift information. The cluster point 
assumes a relative error of 7.5% on a-{R), i.e., that systematic un- 
certainties can be reduced to the level of current statistical errors. 
The lensing and LyaF points assume that the current errors have 
been cut in half. 
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FIG. 9. Same as Figure 8, but for the current linear power 
spectrum P{k). In other words, this is Figure 1 for simulated future 
data. 
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FIG. 10. Curves show ^"'(fc) (top) and "P' (bottom), normal- 
ized to have unit area. These curves show how much different non- 
linear (top) and linear (bottom) scales contribute to the observed 
weak lensing aperture mass variance (Mapp(S)^) for the source red- 
shift distribution of [91 . 



B. Weak lensing data 

Weak gravitational lensing uses photons from distant 
galaxies as test particles to measure the metric fluctua- 
tions caused by intervening matter, as manifested by dis- 
torted images. The first detections of this cosmic shear 
signal [35,36] were reported in 2000 [37-42], and dramatic 
improvements are likely to lie ahead just as for CMB ob- 
servations. For this paper, we will use the results from 
the Red-Sequence Cluster Survey (RSCS) reported in [9], 
which includes data from a record-breaking 53 square de- 
gree sky area. We use the seven data points employed for 
the cosmological analysis in [9], i.e., 



(10) 



where 9 =10', 15', 20', 25', 35, 33, 42', 50'. This mea- 
sured quantity, denoted the aperture mass variance, is on 
average given by 
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(11) 



where J4 is a Bessel function and Pk(^) is the cosmic 
shear power spectrum. The shear power spectrum in turn 
is given by a linear combination of the nonlinear matter 
power spectrum Pn\ikni) over a range of wavenumbers fcni 
and redshifts [35,36], 




10 100 1000 

Angle 5[arcmin] 

FIG. 11. Ranges of k that contribute to Mapp{9)'^ as a function 
of 6. The line indicates the median k and the boundaries correspond 
to the 20th and 80th percentile. The upper (lower) band shows the 
range for nonlinear (linear) scales. 
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where we have introduced the comoving radial coordinate 
Lo and ujh corresponds to the horizon. Here fxi^) is the 
angular diameter distance and W{io) is a source-averaged 
ratio of angular diameter distances. For a given redshift 
distribution of the sources P6(ti^), 
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We use the best fit redshift distribution for the RSCS 
sample from [9], 
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with {a,l3,Zs) = (4.7,1.7,0.302). With a change of vari- 
ables to fcni = 1/ fxi^) we can rewrite equation (11) in 
the form of 



di — 



nl 



K'(fcni)dlnfcnl 



27rc4 



V a{uj) 



12J4{KlfK{uj)0,) 



duo. 



(15) 



7 



The integrand 7'f'(fcni) is an integral over cosmic time 
that depends linearly on the nonlinear matter power 
spectrum Pn\{kni) at various redshifts. The upper panel 
in Figure 10 shows this integrand for a sample of angular 
scales 8. Just as for the CMB, we follow our convention 
(★) and compute the 20th, 50th and 80th percentiles of 
these distributions. The results, plotted in Figure 11, 
show that we approximately have k oc 1/0 as expected 
but that the relation is not particularly tight, with a given 
6 probing a broad range of A;- values. 

To use this relation between k and 9 to map the lens- 
ing data into (linear) fc-space would be quite misleading, 
since the nonlinear power -Pni(fcni) is the result of gravi- 
tational collapse and therefore carries information about 
the linear power on larger spatial scales [43-45] . We will 
use the Ansatz of Peacock & Dodds [45] to quantify this 
effect. Defining 

^(''^=(1^^^^^' ^nl(fcnl) = (^i^nl(M, (16) 

the linear power A on scale k is approximately related to 
the nonlinear power Ani on a smaller nonlinear scale fcni, 



Anl(fcnl) = .fNL{A{k)), 

fc„i = (l + A„i)'/'fc, 



(17) 
(18) 



where fNL{x) is a fitting function that depends on 
both the cosmology and the slope of the linear power 
spectrum.'^ A few caveats about the Peacock & Dodds 
approximation arc in order. It was developed to fit sim- 
ulations of power law spectra, so it can disagree signifi- 
cantly with N-body results when considering power spec- 
tra that are not pure power laws (as is the case here) or 
have wiggles [46-48] . The straight mapping between the 
non-linear power spectrum at one scale and the linear 
power spectrum at a larger scales is only approximate, 
so care should be taken when interpreting our transla- 
tion from aperture mass to P{k). 

In the Peacock & Dodds Ansatz, k determines A which 
determines Ani, which via equation (18) determines fcni. 
This means that we can think of both k^i and Vj in equa- 
tion (15) as functions of the linear wavenumber k and 
change variables in the integral: 



di 



Vl{k)d\nk. 



(19) 



The relation between fcni and k is time dependent, so 
the Jacobian of the transformation cannot be taken out 
of the time integral. The functions 'P\{k) tell us which 
linear scales k contribute to the observed lensing signal. 



^We used a simple F-model fit to the power spectrum (ie. 
a fit with no wiggles) to calculate the slope needed in the 
Peacock and Dodds Ansatz. 



They are plotted in the bottom panel of Figure 10 for 
the seven data points, and their fc-range is shown in as a 
function of 6 in Figure 11. 

We see that the curves P"' and differ dramatically 
on small scales. Not only do the lensing measurements 
probe the linear power spectrum on much larger scales 
scales kr^ than those on which it probes Pni, but the 
fc-range probed is substantially narrower as well. The re- 
lation between 6 and (median) k can be approximated by 
a simple power law with half the slope over the range of 
scales shown in Figure 11, fc ~ (^/3')~'^/^/i/Mpc, with 
the two bands converging only for > 3° where the 
density fluctuations are nearly linear. The implications 
of this are twofold: weak lensing probes P{k) on sub- 
stantially larger scales than a naive back-of-the-envelope 
calculation would suggest, and the fc-space window func- 
tions on P(fc) are quite nice and narrow, facilitating cos- 
mological interpretation of the measurements, although 
the caveats about the Peacock & Dodds Ansatz should 
be born in mind. The last remaining subtlety involved 
in mapping our lensing measurements into (fc, P)-space 
concerns the vertical placement of the points and their 
error bars. Since V\{k) depends on P{k) in a nonlinear 
way, we cannot simply proceed as in the CMB case, in- 
terpreting di as measuring a weighted average of P{k). 
We therefore need to construct a relation between P{k) 
and d{6) around the fiducial model Pfid(fc) (which fits 
the measured data from [9] well). To do so, we compute 
the aperture mass for models with a varying overall nor- 
malization of P{k). As seen in Figure 12, the relation 
between the aperture mass and this overall normaliza- 
tion is well approximated by a straight line in log-log 
space whose slope depends only on 9, so we make the 
approximation 



In 



P(fccff) 



-f{9) In 



f djO) \ 

Ufid(^); 



(20) 



To translate the error bars, we simply multiply the rela- 
tive error in the aperture mass variance measurement by 
7(^) to obtain the relative error in P{k). 

Finally, although by construction we are always map- 
ping the constraints of the different measurements onto 
the linear power spectrum at the present epoch, the lens- 
ing aperture mass measurements are actually sensitive to 
a weighted average of the power spectrum over redshift, 
with a weight that peaks somewhere midway between 
redshift zero and the redshift of the background galaxies. 
Just as before we can write, 



di= r 

Jo 



V!ik)dz, 



(21) 



where z is redshift. For completeness we show these inte- 
grands in figure 13. These functions are seen to be very 
broad and to depend only weakly on the angular scale 0. 
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FIG. 12. Relation between the aperture mass variance and the 
overall normalization of the matter power spectrum (cr^). Both val- 
ues are normalized to the values for the fiducial model. Clockwise, 
the curves correspond to =10, 15, 20, 25, 33, 42 and 50 respec- 
tively. Our approximation that these curves are straight lines is 
seen to be quite accurate. Their slope approaches unity (dotted 
line) in the linear regime (for very large d). 
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FIG. 13. Curves show 'P^{z) normalized to have unit area. 
These curves show where the contribution to the aperture mass 
originates in redshift for each of the seven values of 6 from [9]. 



C. Cluster data 

The abundance of galaxy clusters at various redshifts 
is emerging as an increasingly powerful probe of cosmo- 
logical parameters, as new surveys are enlarging cluster 
samples and x-ray, SZ, optical and lensing observations 
of cluster properties are improving our understanding of 
the underlying physics. 

In principle, a suite of hydrodynamical simulations in- 
cluding all the relevant physics could be used to map out 
the region in cosmological parameter space that matched 
the observed cluster abundance. In practice, this is still 
not numerically feasible, so published constraints involve 
a series of approximations. At a minimum, this tends to 
involve the Press-Schechter approximation or variations 
thereof [49-51] to predict the mass function of dark ha- 
los and some way of inferring the mass of the dark halo 
from the observed properties of the cluster. For exam- 
ple, in studies using X-rays, a mass-temperature relation 
that connects the halo mass with an observed cluster x- 
ray temperature is needed. The consensus result is that 
cluster data constrains mainly a combination of the nor- 
malization of the power spectrum on the cluster scale and 
the cosmic density parameter r2,„. The normalization is 
usually quoted as the rms density fluctuation a{R) in a 
sphere of radius R = 8ft.^^Mpc, given by 



47r 
(2^ 







J — oo 





k^P{k)d\nk, (22) 



where the 1st spherical Bessel function is ji{x) = (sinx — 
xcosx)/x'^. 

For instance, a recent SDSS analysis reports as — 
(0.35±0.03)il~"-^*' [52], basing cluster mass estimates on 
richness rather than x-ray. However, it has been argued 
[53] that quoting results using R = 8/i~^Mpc is confus- 
ing, since the cluster abundance is mainly sensitive to 
slightly larger scales centered around R ~ 15/i~^Mpc. 
The ilm-dependence above comes mainly from the fact 
that the mean density f2„i enters in the Press-Schechter 
formula and collapse overdensity approximation, but also 
from a small r^m-dependent correction for evolution be- 
tween z = and the redshifts observed (say z = 0.1—0.2). 
Including the additional rJm-dependence coming from the 
fact that ri,„ affects the shape of the power spectrum and 
hence the ratio ratio (Tis/ctsj a result like as (x. ^'^'^ 
changes significantly, to something Hke as (x 51^°'^ [53]- 

Since we wish to plot constraints on the power spec- 
trum P{k) of the form of equation (1), we follow [53] and 
use the normalization an at R = 911~°'^^ « 15/i~^Mpc. 
This scale corresponds to a 6.5keV cluster forming now, 
and has the property of giving constraints that to first 
approximation depend on P{k) only via its normalization 
(afj), not via its shape [53]. Using our convention (★) to 
plot the fc-range probed by clusters, the 20th, 50th and 
80th percentiles of the integrand in equation (22) fall at 
k w0.06, 0.10 and 0.15/i/Mpc, respectively. 
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Tabic 2 gives a recent sample of chistcr measurements 
of the power spectrum normalization, all quoted for 
firn = 0.3 for comparison. We see that although the 
quoted error bars are as small as 0.05-0.08, the spread 
in (Tg between papers is many times larger, as great as 
0.41 even during the past year. Since this suggests that 
systematic uncertainties are still larger than statistical 
uncertainties, we simply use the constraint as = 0.8 ±0.2 
to be conservative, mapped to i? = 15/i^^Mpc as in [53] 
to reduce power spectrum shape dependence. 

Table 2 - Recent measurements of the P{k) normalization using 
cluster abundances. 



other details such as the reionization history) are sufB- 
ciently small. 

Figure 1 shows the reanalyzed data [10] with the 
quoted statistical and "systematic" errors added in 
quadrature. The plotted errors do not include an overall 
multiplicative error of 17% stemming from temperature 
and optical depth uncertainties, and the mapping from 
the observation redshift z ~ 2.72 to today may intro- 
duces additional horizontal and vertical shifts that de- 
pend on fijji and as described in Section 111. We use 
the approximation of [10] that the window functions are 
Gaussian with width Ak ss 25 km/s (fc x km/s)~-^/^. The 
13th point is a mere upper limit, omitted to avoid clutter. 



Analysis erg 

Picrpaoli et al. (2001) [53] 1.02^008 

Borgani et al. (2001) [54] 0.76^0 05 

Reiprich & Bohringer (2001) [55] O.eslQgg 

Seljak et al. (2001) [56] 0.75 ± 0.06 

Viana et al. (2001) [57] 0.61 ± 0.05 

Bahcall et al. (2002) [52] 0.72 ± 0.06 

D. Lya Forest data 

The Lyman a forest (LyaF) is the plethora of absorp- 
tion lines in the spectra of distant quasars caused by neu- 
tral hydrogen in overdense intergalactic gas along the line 
of sight. By tracing the cosmic gas distribution out to 
great distances, it offers a new and exciting probe of mat- 
ter clustering on even smaller scales than currently acces- 
sible to CMB and weak lensing, when the universe was 
merely 10-20% of its present age. Since the gas probed by 
the LyaF is only overdense by a modest factor relative to 
the cosmic mean, the hope is that all the relevant physics 
can be simiilated. thereby connecting the observations to 
the underlying matter power spectrum [58-62]. 

The most ambitious such analysis to date [11] claimed 
to do just this, measuring P{k) on 13 separate scales k 
using 53 quasar spectra. An extensive reanalysis by an 
independent group [10] has suggested that the technique 
basically works. One should keep in mind that there are 
many caveats to the Lyman a forest analysis. One won- 
ders to what extent all the relevant physics is included in 
the hydro-simulations and the dark-matter-only prescrip- 
tions that have been developed and how the uncertain- 
ties in the reionization history, the ionizing background 
and its fluctuations propagate into the reconstruction of 
P{k). Moreover, even for the evolution of the dark mat- 
ter alone, which is the basis of the simple Ansatz used to 
determine P{k) from the LyaF data, non-linear correc- 
tions significantly affect the evolution of clustering on the 
scales relevant for the LyaF because the slope of the P{k) 
around the non-linear scale is much closer to rie ff = — 3 
than it is today [63]. We should view the reconstructed 
points as an inversion done assuming that all the relevant 
physics was correctly modeled and that the departures 
from the fiducial model (which in this case also involve 



E. Galaxy clustering data 

Two- and three-dimensional maps of the Universe pro- 
vided by galaxy redshift surveys constitute the P{k) 
probe with the longest tradition. Indeed, the desire to 
measure P{k) was one of the prime motivations behind 
ever more ambitious observational efforts such as the the 
CfA/UZC [64,65], LCRS [66] and PSCz [67] surveys, each 
well in excess of 10^ galaxies. The most accurate power 
spectrum measurement to date is from the 2 Degree Field 
Galaxy Redshift Survey (2dFGRS) [17], soon to be over- 
taken by the Sloan Digital Sky Survey (SDSS) [18] which 
aims for 1 million galax;ies. 

Band powers measured from galaxy surveys are related 
to the underlying matter power spectrum by 

/oo 
Wi{k)b{kfP{k)dlnk, (23) 
-00 

where the window functions Wi depend only on the ge- 
ometry of the survey and the method used to analyze 
it. Here b{k) is the bias, reflecting the fact that galaxies 
need not cluster the same way as the underlying mat- 
ter distribution, and defined simply as the square root 
of the ratio of galaxy power to matter power. Figure 1 
shows the 2dFGRS power spectrum as measured with the 
PKL eigenmode technique [8], which has the advantage 
of producing uncorrelated error bars and narrow, exactly 
computable window functions Wi (see also [68]). 

With galaxy clustering measurements, bias is the key 
caveat. On small-scales, bias is known to be complicated, 
with the galaxy power spectrum saying more about the 
galaxy distribution within individual dark matter ha- 
los than about the underlying matter distribution. We 
have therefore plotted 2dFGRS measurements only for 
k < 0.3/i/Mpc. Fortunately, a broad class of bias models 
predict that b{k) should be simple and independent of k 
on large scales [69-73]. Even if this is true, however, the 
measured large-scale 2dFGRS power spectrum is likely 
to have slightly scale-dependent bias, masquerading as 
evidence for a redder power spectrum, i.e., one with 
a smaller spectral index n. This is because the power 
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spectrum is measured from a heterogeneous magnitude- 
limited sample mixing galaxies of very different kinds. 
Most of the information about P{k) on large scales comes 
from distant parts of the survey, where bright ellipti- 
cals are over-represented since dimmer galaxies get ex- 
cluded by the faint magnitude limit. Since more luminous 
galaxies are known to be more highly biased [74,75], this 
should cause the bias to rise as fc ^ 0. With a massive 
data set like the SDSS, it will be possible to accurately 
measure how bias depends on luminosity and correct for 
this effect. 



III. BREAKING THE DEGENERACY BETWEEN 
PRIMORDIAL POWER AND TRANSFER 
FUNCTIONS 

Above we have shown how to map CMB, lensing, clus- 
ter and LyaF measurements into fc-space when the trans- 
fer functions are known. These transfer functions de- 
pend on various Late Universe cosmological parameters 
(the reionization optical depth r and the matter budget). 
To measure the Late Universe properties (these param- 
eters) and the Early Universe properties (the primordial 
power spectrum P*(A;)) independently, we must there- 
fore break the degeneracy between the two. This may 
at first sight appear hopeless, since the measurements 
involve products of primordial power and transfer func- 
tions, and there is no unique way of factoring a product 
into two terms. As will be described below, the problem 
can nonetheless be solved thanks to two separate facts in 
combination: 

1. T and the matter budget parameters affect different 
types of measurements in different ways. 

2. There is substantial overlap in fc-space between dif- 
ferent types of measurements. 
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FIG. 14. Same as Figure 9, but assuming a baryon density 
fe^Cfc = 0.07 when analyzing the data. Since this figure makes 
no assumptions about the primordial power spectrum P, (fc), the 
glaring discrepancy seen between the power spectrum inferred from 
CMB and galaxy clustering means that this high baryon frac- 
tion can be ruled out without assumptions about Early Universe 
physics. 

A picture can say more than a thousand words, and 
both of these facts are illustrated by the example in Fig- 
ure 14. It shows simulated data assuming that the con- 
cordance model of [5] is true, with the CMB mapped into 
fc-space assuming a higher baryon density, h^fli, — 0.07. 
This alters the CMB and matter transfer functions in 
quite different ways (details below), producing a strik- 
ingly wiggly P{k) inferred from the CMB. Since MAP 
and SDSS overlap by over a decade in k where this wig- 
gliness is seen, it is obvious that the two are inconsistent 
and that such a high baryon density is ruled out. This 
conclusion can be drawn without assumptions about the 
primordial power spectrum P*(fc), since this figure was 
generated without involving P*(fc), merely using mea- 
surements and transfer function parameters. 

In the next subsection, we will briefly discuss how the 
various types of measurements are affected by the Late 
Universe parameters and the relevance of this for mea- 
suring these parameters independently of P*(fc). We then 
describe how how a "chi-by-eye" comparison as in Fig- 
ure 14 can be replaced by a rigorous statistical method 
useful for cosmological parameter estimation. 



A. How Late Universe parameters affect the P{k) 
recovered from different data sets 

Although reconstruction of P{k) as in Figure 14 has 
the advantage of minimizing the amount of processing 
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applied to large-scale-structure (galaxy, lensing, cluster, 
LyaF) data the reconstructed primordial power P*(fc) 
provides better intuition for the present discussion, since 
each data set has, loosely speaking, been divided by its 
own transfer function. In contrast, the CMB data in Fig- 
ure 14 was (apart from smoothing effects) both divided 
by the CMB transfer function We{k) and multiplied by 
the matter transfer function T{k)'^. We will therefore 
center our discussion around P^{k) rather than P{k) in 
the remainder of the paper. 
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FIG. 15. Same as Figure 8, but assuming a baryon density 
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FIG. 16. Same as Figure 8, but assuming a dark matter density 
/i^fic = 0.5 — a ruled out model. 
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FIG. 17. Same as Figure 8, but assuming a cosmological con- 
stant = 0.2 — a ruled out model. 

Imagine generating large numbers of plots like Fig- 
ure 7, each one assuming different values for the Late 
Universe parameters to analyze the same measured data. 
Figure 15 shows the result for the above-mentioned case 
of a high baryon fraction, and is simply the P^j-version of 
Figure 14. Figures 16 and 17 show corresponding exam- 
ples with incorrect assumptions for the cold dark mat- 
ter density /i^ilcdm and the cosmological constant J^a- 
Before delving into details, a few basic facts should be 
noted. The LSS (cluster, lensing, LyaF and galaxy) 
points tend to shift together, since they are all sensitive 
to the matter transfer function T{k)^. These LSS points 
split apart when certain parameters are altered, how- 
ever, notably and the cosmic expansion history a{t), 
which affect the four differently. In contrast, the CMB 
points separate from the other four data types whenever 
any Late Universe parameter is changed, indeed often by 
shifting in a rather opposite direction from the others. 

The recent literature on cosmological model con- 
straints includes a bewilderingly large list of cosmological 
parameters: 

p = {t, fl]^, flA, n^, w, ujdni, (^h, .U, As, Us, a, At, nt). (24) 

These are the reionization optical depth r, the primordial 
amplitudes A^, At and tilts Ug, rit of scalar and tensor 
fluctuations, the running of the scalar tilt a, and seven 
parameters specifying the cosmic matter budget. The 
various contributions fli to critical density are for curva- 
ture r^k, vacuum energy JIa, other dark energy (with 
an equation of state w), cold dark matter Jlcdm, hot dark 
matter (neutrinos) and baryons Qh- The quantities 
cjb = h^i^h and Ucdm = h^^dm correspond to the physical 
densities of baryons and total (cold -I- hot) dark matter 
(fldm = f^cdm + f^iy), and = fl^/fldm IS the fraction of 
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the dark matter that is hot. Additional parameters that 
are often mentioned are not independent, for instance the 
total matter density Om = + ^dm and the dimension - 
less Hubble parameter h = v' (wcdm + '^b)/(l — ^^A^^k)- 

Fortunately, the underlying physics is simpler than this 
parameter profusion suggests. Ag, Ug and a are merely a 
particular parametrization of the primordial power spec- 
trum, corresponding to the Ansatz P*{k) = ^8^""+"'"'^. 
At and nt similarly parametrize the primordial tensor 
(gravity wave) power spectrum as a power law. In other 
words, only the first eight parameters in equation (24) 
are Late Universe parameters affecting the transfer func- 
tions — we refer to all of these except r as the matter 
budget parameters. The tensor parameters At and nt are 
of only marginal relevance to this paper, since they affect 
only the CMB and do so essentially only on scales larger 
than those that overlap with large-scale-structure obser- 
vations. This means that if we assume > in our 
reconstruction, the CMB measurements of P*(A;) would 
shift downwards in Figure 15, but only to the left where 
they cannot be compared with other data. Since our 
method for measuring Late Universe parameters involves 
comparing CMB with LSS data, it is therefore essentially 
unaffected by tensor fluctuations. 

A similar simplification applies to r, which also affects 
only the CMB. On the small scales where the CMB over- 
laps with other measurements, the effect of reionization 
in merely to suppress the CMB power spectrum by a 
constant factor e^"^. 

A further simplification is that Oa, fl^ and w never 
enter in any other way than as a particular parametriza- 
tion of the cosmic expansion history a{t) or, equivalently, 
of the function 
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(1 + zfn^ + (1 + zfn^ + (1 + z)3(i+«') + nA 

(25) 



where the Hubble parameter H = dlna/dt^. Various in- 
tegrals involving this function determine the growth of 
linear clustering, the brightness and angular size of dis- 
tant objects, and volume-related effects. This function 
causes merely a sideways shift in the CMB on the scales 
that LSS can probe, since the late ISW effect is impor- 
tant only on larger scales. Beyond the parameters in this 
function, the only remaining matter budget parameters 
are thus Wm, Wb, fu, specifying the physical densities of 
cold dark matter, baryons and neutrinos, respectively. 

Detailed discussions how the CMB and matter trans- 
fer functions depend on cosmological parameters can 
be found in, e.g., [76-78]. For the reader in- 
terested in more empirical intuition, the movies at 



*If the dark energy is a scalar field that can cluster (ie. 
quintessence) there could be additional effects for low Is. 



www. hep.upenn.edu/^max/cmb/movies. html arc rec- 
ommended. The key point about to take away from all 
this is that the CMB and matter transfer functions de- 
pend quite differently on the matter budget parameters, 
often in rather opposite ways. For instance, increasing 
the cold dark matter density Wcdm shifts the galaxy power 
spectrum up to the right and the CMB peaks down to 
the left. Adding more baryons boosts the odd-numbered 
CMB peaks but suppresses the galaxy power spectrum 
rightward of its peak and also makes it wigglier. In- 
creasing the dark matter percentage that is hot (neutri- 
nos) suppresses small-scale galaxy power while leaving 
the CMB almost unchanged. (The recovered points in 
our P*(fc) figures always respond in the sense opposite to 
that in these movies when assumed parameters are var- 
ied.) This means that combining CMB with LSS data 
allows unambiguous determination of the matter budget. 

Since the heights of the CMB peaks are controlled by 
the densities of ordinary (wb) and dark (wdm) matter, as- 
suming the wrong values for these parameters is seen to 
results in a wiggly i-'«(fc) from CMB in Figures 15 and 16. 
Increasing Ub boosts predominantly the odd-numbered 
CMB peaks whereas increasing t^cdm suppresses the CMB 
peaks with less of an even/odd asymmetry as well as 
shifting the peaks to the left, so the CMB points in these 
figures are seen to depart from unity in the opposite 
sense. Increasing the baryon fraction Wb/wdm produces 
larger wiggles in the matter power spectrum as well, to- 
gether with an overall power suppression leftward of the 
peak. Increasing the dark matter density Wdm pushes 
the turnover in -P(fc), corresponding to the horizon size 
at the matter-radiation equality epoch, to the right and 
thereby boosts small-scale power. There is also a side- 
ways shift in both CMB and matter clustering, since the 
h on the horizontal axis changes with the matter bud- 
get parameters. H{z) gives shifts the transfer functions 
vertically via the linear growth factor and horizontally 
(angle-diameter distance changes, and well as the h in 
the horizontal axis definition). Here is a brief summary 
of how the curves P^{k) recovered from CMB and LSS 
measurements get affected when the assumed parameters 
are changed: 

• Wb, i^cdm' cause wiggles 

• fv- boosts LSS points on small scales 

• Ok, dark energy: cause wiggles via incorrect CMB 
peak locations, vertical offset 

• r: boosts CMB points by e^"^ where they overlap 

with LSS 

Some parameters also affect the conversion of observed 
LSS data to measurements of P{k) as illustrated in Fig- 
ure 17. The cluster point scales roughly as O"-*^-^ in power 
(fi"*^'^ in amplitude), mainly because Om enters in the 
Press-Schechter prescription for halo abundance and in 
the approximation for collapse overdensity. For weak 
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lensing, a factor of enters in the definition of the 
cosmic shear power spectrum, although the final scaling 
with f2i„ is complicated by nonlinearities. All four LSS 
observables at z > must be mapped to z = 0, which 
involves a vertical shift due to clustering growth and a 
horizontal shift from the computation of comoving length 
scales, and these shifts are both given by the cosmic ex- 
pansion history H{z) of equation (25). Since these shifts 
increase with z, they are most important for the LyaF. 
Finally, the P{k) reconstruction from galaxies, clusters 
and the LyaF depend at least weakly on the baryon frac- 
tion r2b/r2m, as is evident from considering what entities 
like galaxy bias, the cluster temperature function would 
be like in the limit of zero baryons. 



E-points go high, revealing that the high baryon fraction 
assumed is ruled out at high significance from CMB data 
alone, without any assumptions about the shape of the 
primordial power spectrum. 

B. Measuring Late Universe parameters 
independently of P*{k) 

Above we described the physics that makes it possible 
to break the degeneracy between Early Universe param- 
eters and Late Universe parameters (r and the 
matter budget). We now turn to the issue of how to do 
this in practice in a statistically rigorous way. 
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FIG. 18. Simulation of the primordial power speetrum reeov- 
ered from the upcoming Planck CMB satello using unpolarized and 
polarized data, assuming a baryon density h^Qi, = 0.07 — a ruled 
out model. Note that the bias goes in opposite directions, since 
polarized and unpolarized peaks are out of phase. The simulation 
was performed using all Planck channels and the MID foreground 
model from [33]. 

CMB polarization provides another powerful and inde- 
pendent tool for breaking the degeneracy between Early 
and Late Universe parameters. The key reason for this is 
that the polarized acoustic peaks are out of phase with 
the unpolarized ones, with the peaks in the E power spec- 
trum lining up with troughs in the unpolarized (T) power 
spectrum. This means that an incorrect assumed value 
for a parameter affecting peak heights (notably ujh and 
Wcdm) causes the primordial power spectrum P*(fc) re- 
covered from T and E will to be biased in opposite direc- 
tions. This useful fact is illustrated in Figure 18, which 
shows simulated measurements from the Planck satellite 
mapped into /c-space. Note in particular the wiggle at 
k ~ 0.04/i/Mpc, where the T-points go low whereas the 



C. The basic problem 

Our basic problem is to measure a vector p of Late 
Universe parameters, say 



P = (t, t^dm, t^b, fu, f^k, ^^a), 



(26) 



independently of P*(fc). Our basic approach is to map 
all measurements into (linear) fc-space and test if they 
are consistent with one another. Repeating this for a fine 
grid of models p, we can map out the region of parameter 
space that is allowed, i.e., where the data are consistent. 

To be specific, let us consider the case of comparing 
two types of data, for instance the MAP CMB power 
spectrum with the SDSS galaxy power spectrum. The 
challenge is that although they exhibit substantial over- 
lap in scale, as seen in Figure 8, they generally have dif- 
ferent window functions. This means that we cannot 
simply subtract the two independent measurements of 
P*(fc) from each other and require the result to vanish — 
it would not vanish even in the absence of measurement 
errors, since the two experiments are probing different 
linear combinations of the underlying function P*(fc). 

Fortunately, a mathematically equivalent problem has 
already been extensively investigated in the literature, 
and we can employ the solution to test our data sets for 
consistency. The question of whether there is any power 
spectrum that is consistent with given band power mea- 
surements was studied in [79] for the case of two band 
powers and generalized to multiple ones in [80]. For n 
band-powers, the best fit power spectrum was found to 
be a linear combination of up to n — 1 delta-functions 
[80]. However, finding their locations is numerically time- 
consuming for large n, which motivated revisiting the 
problem. A fast general method was presented in [81] but 
was somewhat complicated, involving a series of eigenval- 
ues computations. Finally, an information-theoretically 
optimal method was derived in [82] for different purposes 
(comparing CMB maps), which solves our present prob- 
lem as well. This method has also been used to test 
CMB experiments for consistency [82,4]. Since it is not 
only better (in terms of statistical rejection power) but 
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also simpler than the above-mentioned alternatives, we 
will summarize it briefly below. It consists of two parts; 
"deconvolving" the effect of the window functions and 
testing the resulting two measurements for consistency. 
In summary, our method thus consists of the following 
three steps: 

1. Map the measurements into fc-space for fixed a p 

2. Deconvolve the effect of window functions 

3. Test the deconvolved measurements for consistency 

These three steps are repeated for a fine grid of p- vectors 
to map out the allowed region in p-spacc. The implemen- 
tation of Step 1 was described in Section II and we now 
turn to Step 2 and Step 3. 

1. Deconvolving the effect of window functions 

Let us model the primordial power spectrum P^, [k) as 
piecewise constant in a large number of narrow fc-bins, 
with Xi denoting the value of P, in the bin, and ar- 
range the numbers Xi into a vector x. Grouping our two 
sets of band-power measurements into vectors yi and y2, 
this means that they are related to x by 

yi=Wix + ni, y2=W2X + n2, (27) 

for some known window matrices W that are simply dis- 
cretized versions of the window functions computed in 
Section II and some noise vectors with known statistical 
properties. We assume that the measurements are un- 
biased so that (rij) = and define the noise covariance 
matrices 

Ni = (nin*), N2 = (n2n*), (28) 

In this subsection, we describe how the annoying W- 
matrices can be eliminated by computing two decon- 
volved data sets Xj, i = 1, 2, with the properties 

Xi = X + n^, (n^) = 0, (29) 

and known covariance matrices Si = (n^n^*). 

In the generic case, such dcconvolution is strictly 
speaking impossible: we cannot compute W~^yi since 
Wj is not invertible. Certain pieces of information about 
X are simply not present in y^, for instance about sharp 
features on scales much smaller than the widths of the 
window functions or about fc-scales outside the region 
probed by the observations. The basic idea in Appendix 
D of [83] is to accept that certain modes in x cannot 
be recovered, and to record this information in the noise 
covariance matrix for x^ by assigning a huge vari- 
ance to these modes. Any subsequent analysis (in our 
case consistency testing) will then automatically assign 
essentially zero weight to these modes. This is useful in 



practice since all complications related to window func- 
tions are transferred from Wj to Sj where, as we will see 
in the next subsection, they are straightforward to deal 
with. 

The method can be interpreted as combining the real 
data yi with data from a virtual experiment that is so 
noisy that it contains essentially no information, yet has 
enough information to remove all numerical singularities 

by providing independent measurements of each Xi with 
some huge standard deviation a. The final result is [83] 

X, = S,W,*NrV„ (30) 
= [W*NriWi + a-^l] . (31) 

One finds [83] that this prescription has all desired prop- 
erties as long as a is chosen to be a few orders of mag- 
nitude larger than the error bars in the real data, and 
it also has the property of minimizing the noise variance 
in the deconvolved data. If we were to choose a to be 
too small, then the virtual experiment would contribute 
a non-negligible amount of information and bias the re- 
sults. If we were to choose a to be too large, however, 
the matrix Si would contain some enormous eigenval- 
ues (since W^S~"^Wi is typically not invertible) and be 
poorly conditioned, causing numerical problems. 

2. Testing the deconvolved measurements for consistency 

To make explicit that our mapping of measurements 
into fc-space depends on the assumed Late Universe pa- 
rameters p, we will denote the deconvolved measure- 
ments and their noise covariance matrices Xi(p) and 
Si(p) from now on (i = 1,2). Given these two decon- 
volved measurements Xi (p) and X2 (p) of the primordial 
power spectrum vector x (from MAP and SDSS, say), 
we wish to test if they arc consistent. If they are not, 
this rules out the Late Universe parameters p that were 
assumed to construct them. 

Letting z denote the difference of the two power spec- 
trum measurements, 

z(p) =xi(p)-X2(p), (32) 

we consider two hypotheses: 

Hq: The null hypothesis that the assumed Late Uni- 
verse parameters p are correct. Then (xi(p)) = x, 
so that the difFcrcncc spectrum z(p) consists of 
pure noise with zero mean and covariance matrix 
S(p) = (z(p)z(p)*) = Si(p) + S2(p). 

Hi: The alternative hypothesis that the assumed Late 
Universe parameter vector p is incorrect. In this 
case, z(p) is expected to on average depart more 
from zero than under the hypothesis Hq, since both 
Xi(p) and X2(p) become biased power spectrum 
measurements and typically get biased in quite dif- 
ferent ways (Figures 15-17). 
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We will model Hi as a change in the mean, (z) = m 7^ 
0. It can be shown [81,4] that the "miU-buster" statistic 



_ z(p)*S(p)-iQS(p)-iz(p) - tr{S(p)-iQ} 



[2tr{S(p)-iQS(p)-iQ}] 



1/2 



(33) 



rules out the null hypothesis Hq with the largest average 
significance {v) if Hi is true provided one chooses Q = 
mm*. The statistic can be interpreted as the number 
of "sigmas" at which Hq is ruled out [81]. Note that 
for the special case Q oc S(p), it simply reduces to v = 
{x^ ~n)/\/2n, where = z*S(p)~"'^z is the standard chi- 
squared statistic. The null-buster test can therefore be 
viewed as a generalized x^-test which places more weight 
on those particular modes where the expected signal-to- 
noise is high. It has proven successful comparing both 
CMB maps [83,84,82], galaxy distributions [85,86] and 
CMB power spectra [4]. Tips for rapid implementation 
in practice are given in [83] . 

Any choice of Q results in a statistically vaUd test: for 
instance, the region in the Late Universe parameter space 
for which u{p) > 2 is ruled out at the 2— a level. This cor- 
responds to 97.5% significance with the usual frcquentist 
interpretation as a one-sided test if there are many sim- 
ilarly large eigenvalues of I]~^/^QS~^/^, since it makes 
the generalized distribution of z/ approximately Gaus- 
sian by the central limit theorem. 

What is the best choice of Q? The above-mentioned 
choice Q = mm* placed all weight on a single mode 
m. More generally, the test pays the greatest attention 
to those eigenvectors of Q whose eigenvalues are large. 
The standard x^-test (the choice Q = S) has the at- 
tractive feature of giving an assumption-free consistency 
test: since Q has no vanishing eigenvalues, it is sensi- 
tive to any type of discrepancies between the two power 
spectrum measurements xi(p) and X2(p). 

If the goal is to place sharp constraints on cosmologi- 
cal parameters, however, the statistical rejection power of 
the test can be boosted by placing the statistical weight 
on precisely those types of departures which correspond 
to incorrect parameter assumptions. If the rms uncer- 
tainties on the parameters can be estimated using the 
Fisher matrix [87] = dm* /dpi'S~^dm/dpj (with i 
and j running over the parameters), the choice 



Q ^ ^{F-^j 



dm (9m* 
dpi dpj 



(34) 



will have this attractive property as long as the p- vectors 
considered are fairly close to the true value — a desirable 
situation that the cosmology community will hopefully 
keep approaching as data keeps improving. 

Indeed, in this high signal-to- noise limit, the nuUbuster 
test with the weighting given by equation (34) can be 
thought of as the frequentist analog of a Bayesian likeli- 
hood analysis. To understand this we can calculate the 



expectation value of i> assuming Hi is true to lowest order 
in the parameter differences Api. To do so we take the 
expectation value of equation (33), use (zz*) = S-|-mm* 
and m w {dm*/dpi)Api and get, 



Ap*FAp 



(35) 



where Np is the number of parameters. We can compare 
this result to what would be obtained using a likelihood 
analysis. In the Gaussian approximation, the likelihood 
of the null hypothesis is given by 



C oc IS] 



-l/2g-lz*S- 



(36) 



Thus if we calculate the expected value of the logarithm 
of this likelihood assuming again that Hi is true (again 
to lowest order in Ap) we get 



(ln£) 



-Ap'FAp 



(37) 



up to an irrelevant additive constant, the usual result 
that the Fisher matrix gives the curvature of the log- 
likelihood around the maximum. This shows that on 
average and to lowest order in Ap, the (frequentist) null- 
buster statistic coincides with the log-likelihood that one 
would calculate in a Baysian framework, and so on av- 
erage one would rule out the same region of parameter 
space with both methods. 

Note that although it is tempting to perform a stan- 
dard Bayesian likelihood analysis using a "likelihood" 
similar to equation (36), it is not obvious that this can 
be given the standard interpretation in terms of Bayes' 
theorem, since it is not only (z) and S that depend on p 
(as usual), but also the data itself: z = z(p). Since there 
are no such interpretational issues with the frequentist 
null- buster test, it is therefore reassuring that the two 
approaches approximately agree. 



D. Further applications 

Above we described how requiring consistency between 
two sets of measurements (say MAP and SDSS) could 
constrain the Late Universe parameters without assump- 
tions about the primordial power spectrum P^(k). Given 
three or more types of data, an obvious extension is to 
first test two for consistency (say lensing and LyaF), then 
for the parameter subspace where they are consistent, 
combine them as in [81] and test the result against a 
third type of data. Since the LSS data share many pa- 
rameter dependencies as discussed in Section III A, it is 
natural to first let galaxy clustering, lensing, cluster and 
LyaF measurements slug it out amongst themselves and 
then compare the resulting spectrum combining all LSS 
data with CMB. 

Once p has been constrained as described above, it 
becomes possible to measure P* (k) without assumptions 
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about p for the very first time. One way to do this is to 
measure P*(fc) as in Figure 8 using the best fit p- vector, 
and then quantify the error bars and their correlations by 
recomputing (fc) for a grid of models p in the allowed 
region of the Late Universe parameter space, effectively 
marginalizing over p using the observed constraints as a 
prior. 

IV. DISCUSSION 

We have presented a method which complements the 
traditional "black box" likelihood approach to cosmologi- 
cal parameter estimation in two ways: by testing underly- 
ing physical assumptions and by improving physical intu- 
ition for where the constraints come from. We described 
how CMB, galaxy, lensing, cluster and LyaF could be 
compared directly in (linear) fc-space in Section II, then 
showed how a graphical chi-by-eye test could be trans- 
formed into a statistically rigorous method in Section III, 
providing independent measurements of Early Universe 
parameters (the primordial power spectrum P*(fc)) and 
Late Universe parameters (r and the matter budget). 
We found that requiring consistency between unpolar- 
ized CMB measurements and either polarized CMB or 
large-scale structure data is quite promising in this re- 
gard. 

Separating Early and Late Universe physics is partic- 
ularly timely given the excess in the small-scale CMB 
power spectrum recently reported by the CBI team [88] . 
We have seen that the angular scales where this excess is 
seen correspond to spatial scales around k ~ 0.2/i/Mpc 
where the power spectrum is already constrained by 
galaxy, lensing and cluster observations. This makes it 
difficult to blame the excess on the Early Universe, say 
by tilting or adding a feature in P*(fc). The alternative 
explanation in terms of contamination from discrete SZ- 
sources has been shown to be extremely sensitive to the 
power spectrum normalization on the cluster scale, tenta- 
tively requiring « 1 [89]. With our convention (★), (t| 
probes the range k w O.lTlg Qg, which according to Fig- 
ure 5 corresponds approximately to the multipole range 
£ = 2000 ± 800 if the concordance model [5] is not too 
far from the truth. The cluster normalization with 
R w 15/i~iMpc corresponds to ^ w llOOtgQp, scales al- 
ready probed by the shallower (mosaic) CBI observations 
[16]. The concordance model [5] normalized to the CMB 
data gives = 0.81 assuming r = 0.05, and it appears 
likely that quite accurate and robust erg-measurements 
based on the CMB normalization will be available down 
the road. 

This paper is not intended to the final word on testing 
physical assumptions in cosmology, merely a small step 
to be followed by many more. In this spirit, let us close by 
summarizing some of the most important things that we 
have not done and some promising directions for future 
work. 



An obvious first step is implementing our method to 
independently measure the Early and Late Universe pa- 
rameters. Next year will be an appropriate time to do 
this, taking advantage of the revolutionary precision that 
will be offered by MAP. Since this will involve working 
with a large grid of CMB transfer fmictions, the approx- 
imations described in [90] will be useful for this. 

We have made one important assumption throughout 
this paper that it would desirable to test: that the pri- 
mordial fluctuations are adiabatic. The adiabatic as- 
sumption means that the process generating the fluc- 
tuations in the Early Universe created density fluctua- 
tions without altering the density ratios of different mat- 
ter components (photons, baryons, neutrinos, dark mat- 
ter, etc.). By tinkering with these relative densities, it 
is possible to generate a variety of so-called isocurvature 
fluctuations. In addition to the familiar baryonic and 
CDM isocurvature modes, there are obscure ones, for in- 
stance a neutrino isocurvature mode where the ratio of 
neutrinos to photons varies spatially but the net density 
perturbation vanishes, and it can be shown that the most 
general case corresponds to a function P*(fc) that is not 
a scalar but a 5 x 5 symmetric matrix [91]. Although 
it has been shown that CMB polarization will help con- 
strain isocurvature modes, real progress in this endeavor 
is likely to be some way off, requiring the sensitivity of 
the Planck satellite [93,92,94]. On the bright side, these 
complications matter only when studying CMB data, so 
it is valid to compare and combine the power spectra 
measured with galaxies, LyaF, lensing and clusters at 
low (z < 10, say) redshift as described above assuming 
purely adiabatic fluctuations. 

Staying on the topic of still more general tests, there 
exist an elegant "generalized dark matter" formalism for 
describing the gravitational effects of the most general 
matter budget [95] , and this is likely to place robust con- 
straints on dark matter and dark energy as power spec- 
trum measurements continue to accumulate over a range 
of redshifts. 

Needless to say, CMB and LSS data will keep improv- 
ing dramatically in coming years, providing greater sen- 
sitivity, 2;-range, fc-rangc and fc-resolution. In addition to 
the obvious advantages of sensitivity and range, galaxy 
window functions will become narrower as the SDSS sky 
coverage improves, improving the ability to detect sharp 
features in P*(fc). For lensing, the shear power spec- 
trum contribution from a given z has delta function win- 
dows on P(fc) in the small-angle approximation, so the 
smearing in fc-space comes from projection effects and 
can probably be substantially reduced using photometric 
redshift and topography techniques. For the LyaF, the 
~ 10^ SDSS quasar redshifts should greatly improve the 
measurement errors on larger spatial scales, where one 
may hope that uncertainties associated with nonlinear 
physics are smaller. In addition to the CMB and LSS 
probes we have utilized in this paper, many more appear 
promising. For instance, the recently claimed detection 
of galaxy halo substructure [96] falls right on our concor- 
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dance curve in Figure 1 but two orders of magnitude fur- 
ther right tlian the other data points, at fc ~ 100Mpc//i. 
Searches for phase space dumpiness in the Milky Way 
halo with tidal streamers and future space-based astrom- 
etry may provide further constraints on dark matter clus- 
tering on such small scales, and additional clever obser- 
vational ideas are undoubtedly waiting to be thought of. 
The SZ power spectrum is emerging as another promis- 
ing cosmological observable, sensitive the A;-range near 
the cluster scale [97]. 

This avalanche of precision data offers an exciting chal- 
lenge to theorists in the community: to raise the ambi- 
tion level to making precision cosmology mean more than 
merely more apocryphal decimal places, placing our un- 
derstanding of the Universe on a solid foundation where 
the underlying physics has been tested rather than as- 
sumed. 
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